A naïve, salience-based method for speaker identification in fiction books

نویسندگان

  • Kevin Glass
  • Shaun Bangay
چکیده

This paper presents a salience-based technique for the annotation of directly quoted speech from fiction text. In particular, this paper determines to what extent a naïve (without the use of complex machine learning or knowledge-based techniques) scoring technique can be used for the identification of the speaker of speech quotes. The presented technique makes use of a scoring technique, similar to that commonly found in knowledge-poor anaphora resolution research, as well as a set of hand-coded rules for the final identification of the speaker of each quote in the text. Speaker identification is shown to be achieved using three tasks: the identification of a speech-verb associated with a quote with a recall of 94.41%; the identification of the actor associated with a quote with a recall of 88.22%; and the selection of a speaker with an accuracy of 79.40%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of Relationship between Text and Picture in the Selected Iranian and Contemporary American-European Illustrated-Fiction Books Based on the Theory of Maria Nikolajeva and Carole Scott

Illustrated-fiction books are special forms of art that are the combination of text and picture. The relationship between text and picture in this genre is diverse and variegated, and has different effects on the audience; however, little research has been done about it. The goal of this research is to compare text/picture relationship in the selected Iranian and contemporary American-European ...

متن کامل

A Naïve De-lambing Method for Speaker Identification

This paper addresses the issue of close-set text-independent speaker identification from speech samples recorded over telephone. We have known that the speaker identification performance variability can be attributed to many factors. One major factor is the inherent differences in the recognizability of different speakers. In speaker recognition systems such differences are characterized by the...

متن کامل

Analyzing the Content of Adolescents Stories in terms of Identification Dimensions, Based on Erikson, Marcia and Berzonski's theories

Background and Aim: The adolescence period is one of the most important stages in the life of each individual, and the basic component of this period is identity. So far, there have been different views about this period of life. One of these is the psychosocial theory of Ericsson, which defined the crisis as "identity against the confusion of the role". In addition, other people like Marcia an...

متن کامل

The System of Engagement in a Sample of Prose Fiction and the News

Emerging within Systemic Linguistics, Appraisal/Evaluation is a framework for analyzing the language of evaluation, providing techniques for the systematic analysis of evaluation and stance as they operate in whole texts and in groupings of texts. There are three systems in the Appraisal framework: Attitude, Engagement, and Graduation. This study sets out to analyze the use of the system of Eng...

متن کامل

Salience Theory and Pricing Stock of Corporates in Tehran Stock Exchange

How the investors react to the received information plays a crucial role in determining the return of stock exchange market. Supply and demand based upon incorrect decisions lead to the price deviation of inherent values. This paper aims to study the impact of salience phenomenon on disproportionate pricing and investor overreaction in the corporates in Tehran stock exchange. Research methodolo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007